1. NOTEThis document is a collection of equations and expressions that are found throughout the course. All context and background information are absent and only the name of the object is presented. The reader is advised to proceed with extreme caution while engaging with this document.2. Week-1Data matrixX∈Rd×dMean of the data-points⏨x=1
nn∑i=1xiTo center data-pointsx′i=xi-⏨xCentered data matrixXc∈Rd×dXc=a
|
|
x′1
⋯
x′n
|
|
Xc is the centered data-matrix of shape d×n and we will call this X from now.Covariance MatrixWe assume a centered dataset.C∈Rd×d Outer-product formC=1
nn∑i=1xixTi Matrix-formC=1
nXXT Scalar formCpq=1
nn∑i=1xipxiqCpp=1
nn∑i=1x2ipProjection of x onto unit vector w(xTw)wScalar Projection of x onto unit vector wxTwError vector for one data-pointe=x-(xTw)wReconstruction error for one data-point||e||2=||x-(xTw)w||2Reconstruction error for the entire dataset1
nn∑i=1||xi-(xTiw)w||2Error Minimization
min
w||w||=11
nn∑i=1||xi-(xTiw)w||2is the same as
min
w||w||=11
nn∑i=1||xi||2-(xTiw)2Variance of dataset along w1
nn∑i=1(xTiw)2is the same aswTCwVariance Maximization
max
w||w||=11
nn∑i=1(xTiw)2is the same as
max
w||w||=1wTCwFirst Principal Componenta
max
w||w||=1wTCw
=𝜆1
argmax
w||w||=1wTCw
=w1
ith Principal Componenta
max
w||w||=1wTw1=0,⋯,wTwi-1=0wTCw
=𝜆i
argmax
w||w||=1wTw1=0,⋯,wTwi-1=0wTCw
=wi
Projection of xi on subspace spanned by top-k PCs.(xTiw1)w1+⋯+(xTiwk)wkPrincipal component matrixW∈Rd×kW=a
|
|
w1
⋯
wk
|
|
Dimensionality reduced datasetX′∈Rk×nX′=WTXCompression ratio for dimensionality reductionDefined as new/old:nk
nd=k
dReconstruction in RdX′∈Rd×nX′=WWTXReconstruction error1
nn∑i=1aaxi-k∑j=1(xTiwj)wjaa2Compression ratio for reconstruction in RdDefined as new/old:k(n+d)
nd=k
d+k
nTotal variance𝜆1+⋯+𝜆dChoice of k, % variance captured𝜆1+⋯+𝜆k